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f^ Abstract 

o 

^vq We initiate a systematic study of eigenvectors of random graphs. Whereas much is known 

about eigenvalues of graphs and how they reflect properties of the underlying graph, relatively 
little is known about the corresponding eigenvectors. Our main focus in this paper is on the 
nodal domains associated with the different eigenfunctions. In the analogous realm of Laplacians 
of Riemannian manifolds, nodal domains have been the subject of intensive research for well 
^N over a hundred years. Graphical nodal domains turn out to have interesting and unexpected 

^_^ properties. Our main theorem asserts that there is a constant c such that for almost every 

n^ graph G, each eigenfunction of G has at most two large nodal domains, and in addition at 

Q, most c exceptional vertices outside these primary domains. We also discuss variations of these 

questions and briefly report on some numerical experiments which, in particular, suggest that 
almost surely there are just two nodal domains and no exceptional vertices. 



> 
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' — ' 1 Introduction 

^ Let G be a graph and let A be its adjacency matrix. The eigenvalues of A turn out to encode a 

•^ good deal of interesting information about the graph G. Such phenomena have been intensively 

^-^ investigated for over half a century. We refer the reader to the book |14l Ch. 11] for a general 

ff^ discussion of this subject and to the survey article [10] for the connection between eigenvalues and 

r^ expansion. Strangely, perhaps, not much is known about the eigenvectors of A and how they are 

C^ related to the properties of G. However, in many application areas such as machine learning and 

computer vision, eigenvectors of graphs are being used with great success in various computational 

tasks such as partitioning and clustering. For example, see the work of Shi and Malik [19], Coifman, 

et. al. [316], Pothen, Simon and Liou [TB] and others. In particular, a basic technique for spectral 

/\ partitioning (e.g. Weiss ^24J) involves splitting a graph according to its nodal domains. As far as 

jrt we know, the success of these methods has not yet been given a satisfactory theoretical explanation 

and we hope that our investigations will help in shedding some light on these issues as well. We also 

mention that nodal domain counts in graphs are relevant to various studies in statistical physics; 

see the survey p]. 

There is, on the other hand, a rich mathematical theory dealing with the spectrum and eigen- 
functions of Laplacians on manifolds. We only mention this important and highly relevant back- 
ground material and refer the reader who wants to know more about this theory to Chapter 8 in 
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Marcel Berger's monumental panorama of Riemannian Geometry [3]. Suffices it to say that the 
adjacency matrix of a graph is a discrete analogue of the Laplacian (we say a bit more about other 
analogs below). The geometric perspective of graphs is, in our opinion, among the most promising 
and most exciting aspects of present-day graph theory. Among the recent successes of this point of 
view is the metric theory of graphs and its computational applications. The success of the metric 
theory notwithstanding, we believe that other areas of geometry can be incorporated into this line 
of development. 

The only necessary facts we require are that the (geometric) Laplacian has a discrete spectrum, 
that its smallest eigenvalue is zero and that the corresponding eigenfunction is the constant function. 
This is analogous to the fact that the first eigenvector of a finite connected graph is a positive vector 
and in particular, if the graph at hand is d-regular, then its first eigenvalue is d and that in the 
corresponding eigenvector all coordinates are equal. 

Nodal domains of eigenfunctions of the Laplacian have been studied in depth for more than a 
century. We will only discuss this concept in the realm of graphs and refer the interested reader 
to [3] for further information about the geometric setting. So what are nodal domains? Let G 
be a finite connected graph. It is well-known that every eigenfunction / but the first takes both 
positive and negative values. These values induce a partition of the vertex set V{G) into maximal 
connected components on which / does not change its sign; these are the nodal domains of / (see 
below for the precise definition). 

We maintain the following convention: If G is an n-vertex graph, we denote the eigenvalues of 
its adjacency matrix by Ai > A2 > . . .. We simply refer to the Aj as the eigenvalues of G and let 
/i,/2, ... be the corresponding eigenfunctions, normalized in £2- A slight adaptation of classical 
theorem due to Courant (see, for example, ||4j) shows that for every k, the eigenfunction /^ has at 
most k nodal domains. This statement is a bit inaccurate and we refer the reader to [8] for a full 
account of Courant's theorem for graphs. 

We impose throughout some fixed (but arbitrary) ordering on the vertex set F = {vi,V2, ■ ■ ■ ,Vn}, 
andthe coordinates in eigenvectors of G are arranged in this order. We freely interchange between 
the vector (/(fi), . . . , f{vn)) and the corresponding function / : 1/ — > M. In the graph setting, one 
has to be careful in defining nodal domains properly. 

Definition 1.1. Let G = (V^E) be a graph and let f : V ^ M be any real function. A subset 
D CV is a weak nodal domain of / if it is a maximal subset of V subject to the two conditions 

1. D is connected, and 

2. if x,y £ D then f{x)f{y) > 0. 

We say that D is a strong nodal domain if (ii) is replaced by 
2\ ifx,y£D, thenf{x)f{y)>0. 

The main focus of our research is the following problem. 

Question 1. How many nodal domains (strong or weak) do the eigenvectors of G tend to have for 
G that comes from various random graph models? 

To get some initial idea, we started our research with a numerical experiment whose outcomes 
were quite unexpected. We sampled numerous graphs from the random graph space G(n, 2)- It 



d=4 d=5 



Figure 1: The number of nodal domains in a random d-regular 300 vertex graph. There are y nodal domains 
corresponding to the x'th eigenvector (eigenvalues are sorted). For each d we show the average and standard 
deviation of 100 such random graphs. 

turned out that in each and every one of these cases, all eigenvectors of the adjacency matrix had 
exactly two nodal domains. (In the experiments, no eigenvector ever had a coordinate, so the 
notions of strong and weak domains are equivalent.) The same experimental phenomenon was 
observed for several smaller values of p > in the random graph model G{n,p), provided that n is 
large enough. Even more unexpected were the results obtained for random regular graphs. Some 
of these results are shown in Fig. [T| We found that quite a few of the first eigenvectors have just 
two nodal domains, and only then the number of nodal domains starts to grow. 

As mentioned above, there are other discrete analogs to the geometric Laplacian. One often 
considers the so-called combinatorial Laplacian of a graph G. This is the matrix D — A where A 
is G's adjacency matrix and D \s a diagonal matrix with Da being the degree of the vertex Vi. 
It is well known that this matrix is positive semidefinite, and it is also of interest to investigate 
similar questions for the combinatorial Laplacian of random graphs. The convention here is that 
the eigenvalues are sorted as = /ii < . . . < //„• For regular graphs, the question for the adjacency 
matrix and for the discrete Laplacian are equivalent, since Aj + pLn-i+i = d for every i and the 
eigenfunction corresponding to Aj and to fin-i+i in the two matrices are identical. However, for 
graphs in G{n, p) it turns out that the two spectra behave slightly differently. Computer simulations 
suggest that all eigenvectors of the combinatorial Laplacian that correspond to /i2 • ■ • A*n-A for some 
small A have exactly two nodal domains. However, among the last A eigenfunctions, some have 
three nodal domains. Fig. 2 suggests that in a constant fraction of the graphs in G (n, 2) the 
eigenvector corresponding to fin has three nodal domains. 

Our main theorem for G{n,p) partly establishes these observed phenomena. 

Theorem 1.2. For every p G (0,1), if G ^ G{n,p), then asymptotically almost surely the following 
holds for every eigenvector of G. The two largest weak nodal domains cover all vertices in G with 
the exception of at most Op(l) vertices, where Op(l) represents a constant depending only on p. 
The two largest strong nodal domains cover all the vertices {v £ V : f{v) / 0}, with the exception 
of at most Op{l) vertices. In particular, in both cases every eigenvector of G has at most Op(l) 
(strong or weak) nodal domains. 

We remark that our bounds are quite reasonable (see Appendix A). For instance, for p = 1/2, 
we show that there are at most 46 exceptional vertices almost surely. 
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Figure 2: The probability in G{n, ^) that the last eigenvector of the Laplacian has three nodal domains. 
For each n there were 500 experiments carried out. 



1.1 Overview of our approach 

Given G ~ G{n,p) and an eigenvector / of G, we first partition V{G) = VjUMfUSf where Vf,Aff 
are the largest positive and negative nodal domains of /, respectively, and £f is a set of exceptional 
vertices. All eigenvectors are normalized to be unit vectors in £2- 

We show that if £f is large, then we can use the eigenvalue condition, combined with upper 
estimates on the eigenvalues of G{n,p) to find a large subset S of coordinates for which ||/|s'||2 is 
smaller than one would expect for a random unit vector / € S*"^^. The last step is to show that the 
probability that some eigenvector has small 2-norm on a large set of coordinates is exponentially 
small, and then apply a union bound over all such subsets. 

The problem is that the final step seems to require very strong upper bounds on HA^-*^!! (i.e. 
lower bounds on the smallest singular value of A) for a random discrete matrix A. Although there 
has been a great deal of progress in this direction [131 US] 13 EH [71 [18] (see also the survey j23] ) , 
the known bounds for symmetric matrices are far too weak for our purpose. Thus it is crucial that 
we reduce to proving upper bounds on ||^^^|| when A is a random rectangular ±1 Bernoulli matrix. 
In this case, we can employ the optimal bounds of |121 [T^ which also yield the exponential failure 
probability that we require. This reduction is carried out in Theorem |3.1[ 



In Section 3.3 we show that it is possible to get significantly better control on the nodal domains 
of G{n,p) if we could get slightly non-trivial bounds on the ioo norms of eigenvectors. This is a 
possible avenue for future research. 



1.2 Preliminaries 

Notation 1.1. We denote by G{n,p) a random graph with n vertices, where each edge is chosen 
independently with probability p. The set of neighbors of a vertex x G V{G) is denoted by T[x). 

Notation 1.2. For a graph G = {V^E) a function / : y — > M, and any subset S '^ V , we 
denote f{S) = J2yeS f(y)- ^'"' particular, for every x £ V we denote f{T{x)) = Yliyr^xf^v) ~ 
^y^y Axyf{y), where A is the adjacency matrix of G. 



As usual, the inner product oi f,g -.V 



define 



i/p 



(E.eyl/(^)r) >and 



R is denoted {f,g) = Y^xeV fi^)9{x). For p > 1, we 
maxa;gy |/(3;)|. 



Definition 1.3. For p G [0,1], we define the random variable Xp by 

\ p — 1 with probability p 



I p with probability 1 — p. 



In particular, KXp = 0. 



Definition 1.4. Let Mmxkip) be the m x k matrix whose entries are independent copies of Xp, 
and let M^"^{p) be the symmetric k x k matrix whose entries above the diagonal are independent 
copies of Xp, and whose diagonal entries are p. 

Unless otherwise stated, throughout the manuscript, all eigenvectors are assumed to be normal- 
ized in H.2- When A is the adjacency matrix of a graph, we arrange its eigenvalues as Ai > A2 > 
... > Xn- The eigenvector corresponding to Ai is called the first eigenvector, and any other is a 
non-first eigenvector. 

As is customary in this area, we occasionally say "almost surely" as shorthand for "asymptot- 
ically almost surely." All asymptotic statements are made for large n and it is always implicitly 
assumed that n is large enough. 

2 Spectral properties of random matrices 

We now review some relevant properties of random matrices. 

Theorem 2.1 (Tail bound for symmetric matrices). If A ^ M^^(p), then for every ^ > 

Pr(p|| > {2^/p{l-p)+0^) < 4e-(i-"«)«''=/« 

Here \\A\\ stands for the £2 operator norm of A. 

Proof. Fiiredi and Komlos prove in [9, §3.3] that the expected value of the largest magnitude 
eigenvalue of A is at most (2 + o{l))y/p{l —p)\/k. Alon, Krivelevich and Vu prove in [1] (see also 
[Tlj ) that the probability that the largest eigenvalue of A exceeds its median by ^vfc is at most 
2e~? ^/^ ^ and so is the probability that the smallest eigenvalue of A is smaller than its median 
by ^Vk. As usual in the context of sharp concentration, the expected value of the first and last 
eigenvalues differs from the median by at most 0(1). The conclusion follows. D 

We also need a similar bound for A ~ Mmxk{p) whose proof is standard; we repeat it below in 
order to record the exact dependence of the constants on p. 

Theorem 2.2 (Tail bound for non-symmetric matrices). For any m>k, if A ^ ^rnxkip) then 

Pr ( ll^ll > CLi\/p{\ — p)^/m\ < e' 

where 01,02 are constants that depend only on p. 
We first recall that Xp, has a subgaussian tail. 
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Lemma 2.3. For every < p < ^, 

For every g < p < 1, 

Proof. Note that 

Ee*^^ = pe-*(i-P) + (1 - p)e*P. 

In both cases, the claim follows by comparing the Taylor series on both sides. D 

Proof of Thm. \2^ Let A^i C S''^^ and N2 C 5™"^ be |-nets. By standard estimates, \Ni\ < 
9M7V2I <9™. 

Fix some x € Ni and y G N2, and estimate Pr((y, j4x) > ty/m) as follows 

Pr {{y,Ax) > tV^) = Pr fe^<^'^^'> > e^*v^j < \^ 



Define q = max{p, 1 — p}. By Lemma 2.3 we have 



k m 
]ggA(j/,Ax> ^ TT TT Ee'^^'J'^*^^' 

k m 

The optimal choice is A = -^ which yields 

Pr ((y, Ax) > t-y/m) < e ^^ 
and therefore 

2 2 

t rn t m 

FI{3xeNl,yeN2:\{y,Ax)\>t^Jm)<2e ^7^|A^i||iV2| < 2e ^?9^+™. 

By successive approximation, express x G S^~^ as x = J2i>o^i^i where for all i, Xi E Ni and 
\cii\ < 3~*. Likewise for y e S''""-'^ and y = Yli>of^iyi with yj G iV2 and |/3i| < 3~*. Therefore, if 
\{yi,Axj)\ < ty/m for all Xj G A^i and yi £ N2, then 

|(y,ylx)| < Y^ 3~''^{y^,Axi)\ < 3tV^, 

i,j>0 

which means that 

Pr (pll > 3tV^) < 9™+^e ^ . 

Now select t to satisfy jij ln9 < ^, where m = (1 + 5)A;. Concretely, let 



, 2 + 5, 

t = 2qJ2- -ln9. 

^' 1 + 5 
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With this choice, we can take 



I8q 2 + 6 

02 = 3-— -In 9. 
1+0 
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Theorem 2.4 (Tail bound for eigenvalues of G(n,p)). Let G be a graph from G{n,p) withp £ (0, 1) 
and let Xi, . . . , Xn be the eigenvalues of G's adjacency matrix. Then for every i>2 and every ^ > 

Pr (\Xi\ > {2y/p{l-p) + C)V^) < exp{-fn/32) 



Proof. Fiiredi and Komlos prove in [9, §3.3] that E (niaxj>2 |Aj|) < 2y^p{l — p)^/n{l + o(l)). As 
before, the theorem is derived by using a tail bound from [T] . D 

We now state the following theorem from |13t Thm 3.3] (a slight generalization of |12]). spe- 
cialized to the case of the sub-gaussian random variables Xp. 

Theorem 2.5. ^13] For any p G (0, 1), 6 > there exist constants a = a{p,6) > 0,(3 = (3{p) > 
such that the following holds for all sufficiently large k and every w G M™.' 



Pr 



3v £ S s.t. WQv — w\\2 < a\/rn < exp(— /3m 



where m = {1 + 5)k and the probability is taken over Q ~ Mmxk{p)- 

Strictly speaking the results in [13] only apply for symmetric random variables (and thus only 
to Xii2 in our setting). Note, however, that the proof of the stated bound in fTS^ is based on 
three ingredients: (i) The exponential tail bound on the operator norm from Theorem |2.2[ (ii) A 
small-ball probability estimate that is based on the Paley-Zygmund inequality; and (iii) A small- 
ball probability estimate, based on a Berry-Esseen-type inequality. We note that step (iii) does not 
require a symmetry assumption (see, e.g. jSHl §2.1]). 

In order to derive explicit bounds on a and /?, we now give a simpler proof of |13l Thm 3.3] 
(which proof can essentially be read off from [13]) that works when p and 5 are large enough, 
and uses only (ii). For instance, we get non-trivial results for random graphs in G{n,p) when 
p G [0.18, 0.78]. In general, this does not require 6 to be small; for instance, the simpler proof yields 
positive values for a and /3 when p = 1/2 and 1 + 5 > 1000, which is sufficient for the applications 
in Section [3] 

These estimates are only used in the appendix for calculating explicit upper bounds on the 
number of exceptional vertices, whereas in the rest of the paper we use the statement of Theorem 



2.5 above. Nevertheless, the proof is fairly straightforward, and carries pedagogical value for readers 



not familiar with such techniques. First, we recall the Payley-Zygmund inequality [15j. 
Lemma 2.6. For any positive random variable Z and any < A < 1 



,2my 



Pr(Z>AE(Z))>(l-A)^^2)- 
We can now prove (ii), specialized to the random variables Xp. 
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Lemma 2.7. Let Yi, . . . ,Yn be independent copies of Xp and /et ai, . . . , a„ G M satisfy ^ a? = 1, 
then for every s (£M and every < 77 < 1 



Pr 



y^^ajYi 



i=l 



>r?Vp(l-p) >C(l-r?2)^ 



where C 



pH^-p) 



128q 



^ and q = max {p, 1 — p}. 



Proof. Let X = J27=i ^«^« ~ '^- ^y independence, one has 

n 

E[X^] = Y^ aJEiV;") + s'= p{l -p) + s' 

i=l 

Also, setting Y = Y17=i ^^i^i = X + s, we have 



E[y^ 



4 / rPr 

'0 

r>oo 



y^^a-jYi 



i=l 



>t\ dt 



JO 



This inequahty follows from the subgaussian tail bound in Lemma 2.3, For Pr (^ Ojl^ > t) we use 
the result for Xp and for Pr (— ^ a.jl^ > t) we use the result for —Xp = Xi^p. Here it is convenient 

by saying that Ee*^*" < e''^*^/^ for all < p < 1. 



2.3 



to reformulate Lemma 

Using (o — b)^ < 8(a^ + 6^), we have 

E[X^] < 8{E[Y^] + s^) < 128g^ + 8s\ 

We can now apply Lemma 2.6 with Z = X"^ and conclude that 

P.(x^>A(p(i-p) + ,^))>(i-A)^Mi^i|it|;i! 

This last expression has a single local minimum at s = and tends to 1/8 as s ^^ 00. Its value 

128 ^ 8' 



when s = is — — — 



12854 
Thus we have 



C) which is at most J^g < ^ 



Pr {X^ > Ap(l - p)) > Pr {X^ > X{p{l - p) + s^)) > C(l - A)^ 
Letting rj = vA gives us the desired result. 



D 

We now present a proof of Theorem |2.5| which only works for certain values of p, 6 in order to 
calculate explicit upper bounds in the appendix. 



Proof of Theorem 2.5 holding only for certain values of p, 6. We are seeking an upper bound for 

Pr (Bx G S''~^ s.t. Il^lx - w\\2 < a^fm] 



for A r^ Mjjixkip)- We separately estimate this probability depending on whether H^H > ai^/m 
or not, where ai is the constant from Theorem 2.2 An upper bound on the probability that 
ll^ll > al^/rn is given in Thm. 2.2 For the complementary case we use an 7 -net N on S^~^. Let 
X G N and w G M"*. Let fi = \ ^j=i ^ijXj — Wi\ for every i < m. By Lemma 



2.7 



we have 



Pr(/i>r?)>C 1 



p(i-p) 



Define b = C I 



pO--p) J 
Let us bound the probability that \\Ax — w\\^ < c^m for some constant c to be chosen later. 



Pr m - ^ 



Ft {\\Ax - wf < c^m) = Pr ^ // < c^m 

= PM exp (^rm- J f;/2J>l 
< IE(exp(rm-jf;/n; 



1=1 



i=l ^ ^ 



for every r > 0. 
For every i < m 

e(.xp(-^ 



jyi: 



< 



exp( -^f 1 >t\dt 
dt+ I (1 - 6)di 



e-rriVc^ + (1 _ 5) I'l _ e-rr,Vc2^ = 1 _ ft A g-r'jVc^^ 



SO we have 



Pr ( II Ax - wf < c^m) < e™ f 1 - 6 f 1 - e"^^"/^' 



< e 



Tm-b(l-e~^^'^/'''^\i 



This expression is minimized for 



r = ^T In ^^ 



For T to be positive (as it should), we must have c < Vbi] = \fC (1 — ^ _ ■, J ?]. Letting r/ 

p(i-p) . 
3 gives 

, 4C 
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Let 



c = (l 



If-^ 



9 



for some < 9 < 1, which gives 



T = ^{l-e)^Cln ^ 



Using these values for r/, 6, r and c gives 

Pr {\\Ax - wf < c^m) < exp ( -— [l + (1 - 6f (21n(l - 0) - 1)] m ) 

The size of an 7-net on S is at most (1 + 2/7) < ( ^ I assuming 7 < 1. Repeating this 
argument for every x €z N and using the union bound we get that 

4C 



Pr (3x E N s.t. \\Ax — w\\2 < c\fm\ < exp 



[1 + (1 - ef (2 ln(l -e)-l)]m + kln "^ j 



4C7 



exp - — [1 + (1 - 9f (2 ln(l - 0) - 1)] - - 



In^ 

7 



+ 5 



m 



Now we seek a constant a such that if (i) ||A|| < ai-^/m, and (ii) there exists x E S with 
ll^x — w||2 < a^/rn^ then 3x' G A^ with ||Ax' — tt;||2 < c^pm. If x' S A^ is chosen so that ||j; — x'||2 < 7, 
then 

ll^x' — i(;||2 = \Ax — w ^ A{x^ — x)\2 < \\Ax — w\\2 + \\A{x' — x)\\2 < a\/m + 'yai\/m . 



By Theorem 2.2 we have Pr(||A|| > ai^/m) < e '^^™', so taking 



and 



2 p{l-p)C 
a = (l-^)3Y ^ 7«i 



/? = minU2,— [l + (l-0)2(21n(l-0)-l)]-^j 



gives us the desired result. 



(1) 



(2) 
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This preceding argument yields the conclusion of Theorem 2.5 as long as there exists a choice 
of 7, G (0, 1) such that both a and /3 are positive. These bounds are used in Appendix A to show 
that our techniques, if analyzed in gory detail, yield reasonable bounds on the number of nodal 
domains for various values of p. 



The next result follows from taking a union bound in Theorem 2.5 



Corollary 2.8. Maintaining the notations of Theorem 2.5, it is moreover true that 



Pr 



3c G M, 3t; G S'' ^ s.t. \\Qv - cw\\2 < a^/m < exp(-/?m) 



for all sufficiently large k, and every w G 
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Proof. We may assume that \\w\\2 = 1. Since clearly HQf II2 < "m, it suffices to prove the bound for 



c G [—2m, 2m]. Apply Theorem 2.5 for every integer j £ [—2m, 2m] and a union bound to conclude 
that 



Pr ( 3 an integer j G [—2m, 2m] and v £ S s.t. \\Qv — jw\\2 < a^/m\ < exp {—/3m + In (4?7i + 1)) 



But now for any c G [—2m, 2m], let j G [—2m, 2m] be the nearest integer. For any matrix Q and 
i; G S''-^ 

WQv — CW\\2 < WQv — jw\\2 + II (j — c) w||2 < WQv — jw\\2 + 1 . 

D 

3 Nodal domains 

3.1 Eigenvectors are not too localized 

We show first that the restriction of an eigenvector to a large set of vertices must have a substantial 
£2 norm. 

Theorem 3.1 (Large mass on large subsets). For every p G (0,1) and every e > 0, there exist 
a = a{e,p) > and j3 = [5{p) > such that for n large enough, and for every fixed subset S C [n] 
of size 15*1 > (^ + e)n, 



Pr [3a non-first eigenvector f of G satisfying 
where the probability is over the choice of G ^ G{n,p). 



\S\\2 



< a] < exp(— /3n) 



Proof. Let A be the adjacency matrix of G = {V, E), and let / : ^ — > R be a non- first eigenvector 
of G with eigenvalue A. Assume w.l.o.g. that a < 2' ^^'^ let 5 C y be as in the theorem. 
For every x £ S, the eigenvector condition A/(x) = /(r(x)) implies that 



y&V 



|A/(x)|. 



Or equivalently. 



Y^{p-A,y)f{y)-pY,f{y) 

yeV y&V 



|A/(x)|. 



Squaring and summing over all x G S" this yields 



xeS 



Y^(p-A,y)f{y)-pY,m 

yeV y&V 



lAMl/lslli- 



(3) 



Let us define M = pJ—A, where J is the nxn all ones matrix, and let B be the |5| xn sub-matrix of 
M consisting of rows corresponding to vertices in S, then (|3| implies that \\Bf—pfl\\2 = |A| • 11/15112, 
where /= (/, 1). 
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Furthermore, if we decompose B = [P Q] where P contains the columns corresponding to 
vertices in S, and Q the others, then clearly P ~ M^g^{p) and Q ~ M|5'|x(n-|S|)(p)- (It is useful 
to think of P as corresponding to the subgraph of G induced by S, whereas Q corresponds to the 
bipartite graph corresponding to S and its complement). Since Bf = P{f\s) + Q{f\^), we can 
write 

WQifls) -PfMh < \\Bf - pflh + WPflsh = |A| • II/I5II2 + \\P{f\s)\\2 (4) 



The main point now is that, by Corollary 2.8 Q (a random rectangular matrix with a// entries being 
i.i.d. copies of Xp) is extremely unlikely to map /|^ near the 1-dimensional subspace spanned by 1. 
But we know from concentration results that if f\s has small 2-norm, then the RHS will be small. 
This leads to a contradiction. 



Now, since / is a non-first eigenvector, by Theorems 2.4 and 2.1, there exist constants C 
C{p),P' = 13' {p) > such that 



Pr [|A| + ||P|| > C^/n] < exp(-/3'n). 



(5) 



2- 



If we assume that |A| + ||P|| < C^/n and also ||/|s||2 < a < 5, then a = II/I5II2 = Vl — a^ > 
In this case, (H| implies that 

\f\s)-^Jl\\2<2CV^a. (6) 

for any 6 > j^^ > 0, there exists a, /3 > (with /? 



Now let k 



n 



\S\. By Corollary 



2.8 



depending only on p) such that for n large enough, 

s.t. WQv 



Pr 

•"*4(i+«)xfc(p) 



3veS''-\3cG 



cl\\2 < 2Ca-v/n < exp(— /3n 



(7) 



but this implies that ([6| and |A| + ||P|| < C^/n occur with probability at most exp(— /3n). Taking 
a union bound over (ITFand d5|, we see that for some q > 0, 

Pr[||/|5||2 < a] < exp(-/3n) +exp(-/3'n), 

where both /?, /3' > depend only on p. This completes the proof. D 

We record the following simple corollary. 

Corollary 3.2. For every p G (0, 1) there exist r = r{p) > and e = e{p) with < e < 2 such that 
for almost all G ~ G{n,p) and every subset S C V{G) with \S\ > (5 + e)n we have II/I5II2 > r. 

Proof. We need to take a union bound over all subsets S C [n] with \S\ > (2 + s)^. But for every 
value P = j3{p) from Theorem 3.1, there exists an e < 2 such that the number of such subsets is 
o(exp(/?n)), hence the union bound applies. D 



3.2 Bounding the number of exceptional vertices 



We now turn to prove Theorem 1.2 Fix some p G (0,1) and consider G(y,E) ~ G{n,p). We 
concentrate on the part of the theorem that concerns weak nodal domains; exactly the same proof 
works for strong domains after we delete the vertices at which an eigenvector vanishes (which do 
not contribute to the count of strong nodal domains). 



Let r = r{p) and £ = e(p) be chosen as in Corollary 3.2 In everything that follows, we assume 



that n is sufficiently large. By Theorem 2.4, it almost surely holds that every non-first eigenvalue 
A of G satisfies |A| = 0{y^p{l — p)n). We thus can and will assume that this bound holds for the 
remainder of this section. 



12 



Lemma 3.3. Almost surely every non-first eigenvector f : V^MofG satisfies \{f, 1)| < 0(1/ ^/p). 

Proof. As usual, A is the adjacency matrix of G, A is the eigenvalue of / and J is the all-ones 
matrix. Let M = pj — A. Then, 

A(/, 1) = (/, ^1) = {f,pJl) - if, Ml) = pn{f, 1) - (/, Ml). 

It follows that 



|(/, 1)1 = K/'^i)l < ll^-^ilb < V^\\M\ 



pn — X pn — X pn — X 



But since M ~ M^^{p), we know that ||M|| = 0{y/p{l — p)n) almost surely. Therefore almost 
surely we have 

K/,i)|< Q(-^a^ ^,^/i 

- pn-0{^il-p)n) - \Vp 

D 

We now derive an almost sure bound on the number of weak nodal domains. To this end, for any 
function / : 1/ — > M, we let Vf and A// be the largest non-negative and non-positive domains in / 
(with respect to the random graph G). Define £f = V\{VfL)Mf), and let Zf = {v & V : f{v) = 0}. 
In particular, observe that £f f] Zj = since we are discussing weak domains. Although the 
next lemma is stated in terms of nodal domains, it should be clear that it is, in fact, a simple 
combinatorial observation about random graphs. 

Lemma 3.4. For any f : V ^ M, if Di, . . . , Dm are the weak nodal domains in £f, then almost 
surely m = 0{p^^ logn) and \Di\ = 0{p^^ logn) for every i G [m]. 

Proof. If i-*i, P2j • • • 1 -Ps are non-negative nodal domains, then selecting one element from every Pj 
yields an independent set of size s. By a standard fact (which follows immediately from a union 
bound) about graphs in G{n,p), almost surely s = 0{p~^ logn). Furthermore, since each |Pj| < \Vf\ 
and there are no edges between the two sets, almost surely \Pi\ < 0{p~^ logn). The same holds for 
the non-positive nodal domains in £f. D 



We are now ready to complete the proof of Theorem 1.2 We will use the following straightfor- 
ward fact about G{n,p). 

Fact 3.5. For any fixed k £N, the following holds almost surely over the choice G ~ G{n,p). For 
any xi,X2,... ,Xk € V, \T{xi) U ••• UT{xk)\ = (!-(! -p)^ ± o(l))n and \T{xi) Pi ••• nr(xfc)| = 
(p^ lb o(l))n. 

Lemma 3.6. It almost surely holds that every non-first eigenvector satisfies \£f\ = Op{l). 

Proof. Since we seek a result that holds asymptotically almost surely, we take the following liberty: 
Any property that holds for almost every graph in G{n,p) and is needed in the proof is assumed 
to hold for G. 

We can assume that / has a consta nt sig n on £f, for i( x,y £ £f satisfy /(x) > > /(y), then 
also r(x) nr(y) C £f. But from Lemma 3.4 \£f\ = 0(log n), whereas |r(x) nr(y)| = {p^ -|-o(l))n 
by properties of random graphs. 
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Let k be an integer such that ^J^_ \k < 1- Let {xi,X2, • • • ,Xfc} C £f and assume without loss 

of generahty that /(xj) < for i = 1, . . . ,k. This imphes that [Pf ^ £f\ > (1 — (1 — p) — o(l))n 
(because this set contains the union of the neighborhoods of xi, X2, . . . , x^). Therefore |A// \2f\ < 
((1 — p) + o(l))n. By Lemma |3.3[ we have 



By Markov's inequahty, we know that there exists a subset S '^Vj U £f such that 

-+e 

\S\ > i_n_p)fc l^/ U <S/|, and for every y e S, 



-(i-p) 

V((l-p)'= + o(l)) 



/(y) < 



n 



< 



< 



1-T:fe)l^/U^/I 
(l-(l-p)fe) V((l-p)fe + o(l))n 

V((l-p)'= + o(l)) 



Consequently, 



a - e - {1 - p)'')V^ 



v/((l-p)'= + o(l)) / V/((1-P)'= + «(!)) 



sib < V^/U^/lTf^V^^— ^r^ < 



(i-e-(l-p)'=)y^- l-e-{l-py 



Corollary 3.2 yields that II/I5II2 > r = r(p). It follows that 

^((l_p)fc + o(l)) 



1_£_(1_P)A 



> ^- (8) 

We conclude that A: = O ( - log I —^ — r J J , which finishes the proof since r, e can be chosen depend- 
ing only on p. D 

3.3 Future directions 

First, we suspect that the following question should not be too difficult to resolve. 

Conjecture 1. For every fixed p £ (0, 1), for G ~ G{n,p), almost surely every eigenfunction f of 
G satisfies 

{v G V{G) : f{v) = 0} = 0. 

For simplicity in what follows, we only discuss weak nodal domains. As before, Vj and A// 
are the largest non-negative and non-positive weak domains, and £f = V \ (Vf U A//) is the set of 
exceptional vertices. We observe that if sufficiently good lower bounds on |A//| and \Vf\ hold, then 
the number of exceptional vertices is at most one. We illustrate this for p = 2, but the extension 
to general p is straightforward. 
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Lemma 3.7. Suppose that there exists an eq > such that almost surely, every non-first eigenvector 
f of G ^ G{n,2), satisfies |'P/|,|A//| > (4 + eo)n. Then almost surely every eigenvector has at 
most one exceptional vertex. 

Proof. Almost surely, every pair of vertices x,y ^ '^{G), satisfies |r(x) LlT{y)\ > (| — o(l))n. 
Assuming the stated lower bound on {Vfl and |A//|, there must be an edge from {x,y} to both Vf 
and A//. But if x and y are both exceptional, then / must have the same sign on x and y (see, e.g. 
the proof of Lemma 3.6), implying that there cannot be both types of edges. D 



Next, we show that if one can obtain a slightly non-trivial upper bound on 



for eigenvectors 



/ of G{n,p), then almost surely all eigenvectors of such graphs have at most O(-) exceptional 
vertices, and e.g. at most one exceptional vertex for p £ [0.21,0.5]. First, we pose the following 
natural problem. 



Question 2. Is it true that, almost surely, every eigenvector f of G{n,p) has 
natural guess would be that for almost all graphs, every eigenfunction satisfies 



n 



0(1)? The 

o(l) 



Very recently we learned that Tao and Vu (22| Prop. 58] have made significant progress on 
this question by showing that it holds for all but o{n) of the eigenvectors. The preceding question, 
though, is still open. A positive answer yields more precise control on the nodal domains of G{n,p). 



Theorem 3.8. Suppose that for almost every G ~ G{n,p) it holds that all eigenvectors f of 
G satisfy \\f\\oo = o(l). Then almost surely every eigenvector has at most kp = [-^ ^ 
exceptional vertices. 



-log2(l/{l-p))- 



In order to prove this statement, we need the following strengthening of Theorem 3.1 In 
particular, we require that the subset of vertices S be allowed to (weakly) depend on the random 
choice of G ~ G{n,p). 

Theorem 3.9. For every p £ (0, 1), and e > 0, there exist values a = a(e,p) > and (3 = (3{p) > 
such that the following holds. Suppose G ~ G{n,p) and A is the adjacency matrix of G. Suppose 
further that S C [n] is a (possibly) random subset which is allowed to depend on the rows of A 
indexed by a set T C [n] with \T\ = o{n). Then for all sufficiently large n, we have 

Pr [3a non-first eigenvector f of G with II/I5II2 < Q;||/|y\Tll2 O'^d \S\ > (| + e)n] < exp(— /3n). 



Proof. Let A be the adjacency matrix of G = {V, E), and let f : V 



be a non-first eigenvector 



of G with eigenvalue A. Let S CV he a possibly random subset with \S\ > (^ -\-e)n. Let T C [n] be 
such that \T\ < o{n) and S is determined after conditioning on the values of A in the rows indexed 
by T. Assume, furthermore, that II/I5II2 < "||/Iv\tI|2- 

Again, for every x G 5, the eigenvalue condition A/(x) = /(r(x)) implies that 



Y, A,yf{y) + Y,A.yf{y) 
y&V\T y&T 



|A/(x)|. 



Or equivalently. 



E ip-A-y)fiy)-p E f{y) + T.^-yf^y^ 

yeV\T yeV\T yaT 



|A/(x)|. 
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Squaring and summing over all x £ S \T yields 

ip-A.y)f{y)-p Y. f(y) + Y^A,yf{y)\ = \X\' ■ \\f\s\T\\l (9) 

yGV\T y&T J 

As above we define M = pJ—A, but now we let B be the IS'VTl x |1^\T| sub-matrix of M consisting 
of rows corresponding to vertices in S" \ T and columns corresponding to vertices in V\T. Now let 

^ ^ \\f\v\Th ' *^^" ® implies that 

\\Bg — c/wrib < 2|A| • a 

where wt G M'^ ' is a vector which depends only on the rows of A indexed by T and c/ G M is some 
constant depending on /. Note that we have used the fact that ||/|s'||2 < c^ll/lyxrlb- Furthermore, 
B and S are independent random variables conditioned on T. From this point, the proof proceeds 



just as in Theorem 3.1 D 



Proof of Theorem 3.8. Our goal is to show that almost surely \£f\ < kp for every non-first eigen- 
vector / with associated eigenvalue A. Suppose, to the contrary, that \£f\ > kp, and let U ^ £f 
have \U\ = kp + 1. Consider F = UueC/''^(^)' -^^ properties of random graphs, it holds that 

in >[!-(!- pp+^ - o{l)]n > (2 + ep - 0(1; 

for some positive £p. Thus for n large enough, we may assume that indeed |F| > (2 + £p) 



n. 



Again, by Theorem 2.4 we have |A| = Op{y/n) almost surely, and thus we assume this bound 
holds for the remainder of the proof. Now for each u € U, let Du be the nodal domain of u with 
respect to /. Using the eigenvalue condition, for every u £ U, we have |/(F(u))| = |A/(u)| = 
Op{y/n)\f(u)\. Every neighborhood F('u) has non-trivial intersection with only one of Vf or A//, 
hence grouping terms by sign, we have 

Y. \f{x)\ < \f (T{u)n{VfUAff)) + f (T{u)n£f\D.,)\ + \f{D^)\ 
xer(«) 

< |/(F(n))| + |Z)„| 

< Op(V^)|/(n)|+0(logn), 



where we have used the estimate on |Z)„| from Lemma |3.4| which holds almost surely. 
In particular. 



Y E \f{x)\<{kp + mf Woo ■Op{V^) + 0{kplogn). (10) 

Now let T' = {x £ T : |/(x)| < -y=} for some c = c(n) > to be chosen momentarily. Using 
(M, we see that |F \ F'| < Ofev^yMI/ll°° . Under the assumption |A| • ||/||oo = o{^/Ti), we can 



choose c = 0(1) so that |F \ F'| = o{n), in which case we may assume that for n large enough, 
|F'| > (5 -|- £p) n for some e'p > 0. We have ||/|r'||2 = o(l), hence also 

||/|r'||2 = o(l)-||/|ywll2> (11) 
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since ||/|c/||2<V|C/|-||/||oo = o(l). 

Since T' depends on / and not just on the rows of the adjacency matrix corresponding to U, 
we cannot directly apply Theorem |3 .Of Instead, we will apply the theorem to a collection of sets U 
which are determined by T (which is determined by the sets {T{u)}u£u)i with the guarantee that 

r' eu. 

To this end, let y{n) = o{n) be an upper bound on the size of |r \ r'|, and consider 

U = {W ^r:\W\> \r\-y{n)}. 

Then, we have \h(\ < („f^\), T' S U, and the collection U is completely determined by the rows of the 
adjacency matrix of G corresponding to the vertices in U (as it is determined by T = IJu^i/^iu))- 
We may enumerate U = {Ui,U2, ■ ■ ■} in such a way that each Ui is determined after conditioning 
on r (simply by canonically ordering all subsets of the vertices, and taking the induced ordering 
onU). 



We can thus apply Theorem 3.9 to each of the („J^-)) sets Ui ^lA (one of which will always be 



the set r') and take a union bound to obtain, for some [3 = (3{p) > 0, 

Pr[3^ s.t. ||/|r'||2 = 0(1) • \\f\v\uh and |r'| > (^ +4)n] < (^ "^ i) (y(''^)) exp(-/?n), 
and the latter quantity is o(l) since y{n) = o{n), but this contradicts (|11[). D 



Remark 3.1. Observe that even under the preceding assumptions, we are not able to rule out the 
case of one exceptional vertex v with, say T{v) = A// and "Pf = V \ (A// U {v}). 



APPENDIX 

A A few examples 

We used a simple MATLAB program to compute an upper bound on the constant behind the Op(l) 



^TT 



term in Lemma 3.6 for various p's. It calculates the largest k suc h that inequality (l8|) holds, using 

2.2 and ([I]) and (|2]). Our bounds for 



r = —2D — ' and the explicit bounds computed in Theorem 
various p's are given in the following table. 
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p 


k 


0.78 


29 


0.74 


30 


0.7 


32 


0.66 


34 


0.62 


37 


0.58 


39 


0.54 


43 


0.5 


46 


0.46 


54 


0.42 


63 


0.38 


75 


0.34 


90 


0.3 


109 


0.26 


137 


0.22 


181 


0.18 


277 
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